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Abstract 



C^ • We apply the method of characteristics for the solution of pde's to two combinatorial 

H ! problems. The first is finding an explicit form for a distribution that arises in bio- 

informatics. The second is a question raised by Graham, Knuth and Patashnik about a 
sequence of generalized binomial coefficients. We find an exact formula, which factors 
in an interesting way, in the case where one of the six parameters of the problem 
C^ I vanishes. We also show that the associated polynomial sequence has real zeros only, 

provided that one parameter vanishes, and the other five are nonnegative. 



The method of characteristics is well known in the theory of partial differential equations, 
and it is widely used for solving linear pde's of the first order. An introductory description of 
the method and a number of examples are in 0. In combinatorics the method has scarcely 
been used, though perhaps it deserves wider recognition. The main idea is that a linear 
recurrence in more than one variable will lead to a linear partial differential equation for 
the generating function of the solution of that recurrence, and the method of characteristics 
might well be able to yield the solution. We give two examples below. 

The first example is intended to introduce the method in a combinatorial context, and 
it concerns a rather special problem that was motivated by DNA sequence matching. The 
second example is drawn from a Research Problem in Graham, Knuth and Patashnik |2I, 
concerning a two- variable recurrence that generalizes the familiar Pascal triangle and thereby 
defines a family of generalized binomial coefficients. In the first example we obtain a complete 
answer to the question, using the method of characteristics. In the second example we obtain 
a complete solution also, though its form is very unwieldy. In an important special case, 
though, we can obtain the solution in a nice form. 

1 A problem from bio-informatics 

The following question has application to the problem of deciding if two DNA samples match 
sufficiently. It was asked by Warren Ewens, of the University of Pennsylvania. 
Let Qo{x) = 1 — x" and 

Qk+iix) = Qkix) + l^Q'kix) {k = 0,1,2,...). (1) 

It is required to find a nice form for Q^ and to study its properties. 
We will in fact show three nice forms for Qk{x). The first one is 

Qk{x) = 1 - E ("") (1 - xyx-~', (A; = 0, 1, 2, ... ) (2) 

i=o ^^^ 

Next we have, for A; = 0, 1, 2, ... , 

g,.,^-,.-.)(:)(-ir'E(„!,)^-W;;:r"^ '^) 



r>l 



But the nicest form is 



n— fc— 1 



Q,ix) = {i-xr' y: Ct ]^^- (4) 



j=0 



From this last form it is clear that 

lim Qkix) = 1 {\x\ < 1) 

n—>oo 

for every A; > 0. Thus for fixed fc > 0, |a;| < 1, (5fc(a^) is a cumulative probability distribution 
function oi n = 0,1,2,3,.... Indeed, keeping k and x fixed, the corresponding probability 
density function is 



Pn = U ) ^ (1 - ^) in = k,k + l,k + 2,...). 

To prove these things, put Q{x,t) = J2k>oQki^)^'^- Then the recurrence (JH) becomes, 
after the usual generating function operations. 

This linear partial differential equation of the first order can be solved by the method of 

characteristics. To do that we find two independent solutions of the ordinary differential 

equations 

dx dt dQ 

X- 1 ~ 1-t ~ ~Q' 

We easily find such solutions in the form 

(l-t)(l-x) =ci; (l-t)g = C2. 

Hence the general solution of the p.d.e. © is C2 = /(ci), where / is an arbitrary function, i.e., 
(l-t)Q(x,t) = /((l-t)(l-x)). The initial data Q(x, 0) = 1-x" yields /(ix) = l-(l-w)", 
hence 

Q(..i)^ ^-"-';:f-"" . 

After expanding the power above, the form Q follows. To get © we expand the power of 
1 — X in Q, collect powers of x, and observe that the coefficient of each power of x is a 
Vandermonde convolution sum that can be evaluated in closed form. 

Finally, we can verify form (@]) as follows. Expand the factor (1 — x)^^^ in (0)), to get 



j<n-k-\ ^ ■' 



i ^ ' j<n-k 



But by Gosper's algorithm the last summand is 

'k +1\ fk + j 



{-ly 



where 



9j 



r-J/ \ J 

{-iy+\k + jy.f k 



9j+i-9j, 



r{j — l)\k\ \r — jj 



Hence the inner sum is 



Qn—k 



rin — k — iy.k\ \r ~ n + k 



;-i) 



n-k+l 



n — k fn 



kj\r~n + kj^ 



and we have arrived back at form Q again, which completes the proof. 

A glance at form Q also shows that Qk{x) vanishes for all k > n. For example, when 
n = 5 the sequence {Qk{x)}^ is 

1 - x^ 1 - 5 x^ + 4 x^ 1 - 10 a;^ + 15 x^ - 6 x^ 1 - 10 x^ + 20 x^ - 15 x^ + 4 x^ 
1 - 5 X + 10 x^ - 10 x^ + 5 x^ - x^ 0, 0, 0, . . . 



2 The problem of Graham et al 

In j2], "Research problem" 89 of Chapter 6 asks for a general theory of recurrences of the 
form 



(an + pk + 'j) 



n — \ 
k 



+ {a'n + l3'k + -i') 



n — 1 
k-1 



+ ^nfi^kfi, 



(n, k>0) (6) 



which, of course, includes the recurrences for the binomial coefficients, the Stirling numbers 
of both kinds, the Eulerian numbers, and a variety of other combinatorial sequences. 

A number of contributions to this problem have been made by P. Theoret, in his disser- 
tation |1] and in two subsequent papers 0, 0- In jl] he considered both eq. (jH)), whose 
solutions he termed "special hyperbinomial sequences," as well as the generalization in which 
the coefficients of (jH)) are not required to be linear functions of n and k, which were called 
"hyperbinomial sequences." 

For technical reasons, we'll begin by rewriting their recurrence just slightly, as 



n + 1 
k 



{an + /3k + 'j) 



+ {a'n + (3'k + 7') 



n 
k-l 



{n,k > 0; 



^k,o) 



(7) 



2.1 The generating function 

Following the usual generatingfunctionological techniques, we introduce the gf 



^y)=Y. 



u[x, ^ ^ 

n,k>0 






It is then easy to translate the recurrence ^ into a linear partial differential equation of 
first order, viz. 

{I -ax- a'y)^4r^ = ^(^ + ^'v)^^^^ + (7 + (Z^' + Y)y>{^. v), (8) 

ox oy 

in u{x,y), together with the initial condition u{0,y) = 1. Such equations can in principle 
always be solved by the method of characteristics, though that method may require some 
tricks to make it work. Fortunately the Maple package PDEtools can save us the trouble of 
dealing with these tricks, and we can easily check that the results that it obtains are correct. 
We'll now do a few specific examples and then exhibit the general solution. 

• Binomial coefficients. With (a, /3, 7, a', (3\ 7') = (0, 0, 1, 0, 0, 1), the pdsolve command 
produces the general solution 

where F is an arbitrary function. Since m(0, y) = 1 = F{y), we have u{x, y) = e^^^^'^\ 

• Stirling numbers of the first kind. Now (a, /?,7, a', /5',7') = (1,0,0,0,0,1) yields the 
general solution 

u{x,y) = F{y){-l + x)-y, 

which together with u{0, y) = 1 gives the answer u{x, y) = {1 — x)"^, as is well known. 

• Stirling numbers of the second kind. Here (a,/?, 7, a', /?', 7') = (0,1,0,0,0,1), and 
the Maple package finds the general solution u{x,y) = F{ye^)e~'^. The condition 
u{0,y) = 1 chooses the answer 

u{x,y) = exp{y{e'' - 1)). 



2.2 A more general case. 

Take the case in which a' = but otherwise the remaining five parameters are unrestricted. 
Now the pdsolve command gets the general solution, and after applying the initial data 
u{0,y) = 1 the generating function for this case is obtained in the form 



u{x,y) 



'1 — ax) <y- 



1 + f 2/(1 - (1 - ax) 



_i!.\l+F 



(9) 



By expanding this in a power series we find a very explicit form for the generalized binomial 
coefficients, viz. 



^(fT(^0'S'-''-G)(^ 



h\\^ ) \p 



(10) 



in which x^ is the usual rising factorial. A close inspection of this result shows that it is the 
product of a polynomial in a, P, 7 only times a polynomial in 13', 7' only. We can make this 
more explicit by expanding the rising factorials into powers. This is a tedious calculation, 
but the result is that 



£,r,m '- 



k 



mj k 



E 



Jl+i2 = '= 



n 
n — ii 



n — ii\ \k + 12 
k + i2 



^il^i2Y3(^/)il(^')j2 



The factorization of the solution into a function of a,/?, 7 only, times a function of f3', 7' 
only, is 



E 

\il+i2+i3=n—k 



n 
n — ii 



n — ii\ f k + 12 



k + i2 



k 



a'^p'^f' 






The appearance of three kinds of delimited Pascal-triangle sequences, viz. both kinds of 
Stirling numbers and the binomial coefficients, in the first factor is striking. 

As an example we have, when a' = and the other five parameters are unrestricted. 



^lla^ + 18ai3 + 7p^ + 1807 + 12(3-/ + Q-f^){(3' + 7')(2/3' + 7')- 



Additionally, we discover a major difference between this case where a' = and the general 
case in that the general case does not exhibit such a factorization as above. For example, if 
all six of the parameters are unrestricted, we find that 



/2 



/2 



.2 „/2 



l2 „l nl 



18 a 7 a + 7 /3 7 a + 11 7^ a + 1% a' a 13' + 29 a 13 a' (5' + II (i' a p 



2 0/2 



32 nil 



+51 a-f a f3' + 29 (3j a' P' + 22 Y a (3' + 22a' f3' + 36 a f^ p' + U (^' l3 



3/2 



3/2 



„2 ,3/2 



32 „/,,/ 



+36 a 7 /3' + 24 /5 7 p' + 12 7^ /?'" + 18 a' a'Y + 29 a P a' 7' + 11 p' a' 7 



32 0/ ^f 



+44a7a'7' + 26/?7a'7' + 18 7^a'7' + 33a^/3'7' + 54a/3/?'7' + 21/?^/3'7 



,2, /2 



/2 



32„/2 



+54 a 7/3' 7' + 36 /37/5' 7' + 18 7^/?' 7'+ 11 a^ 7'' + 18 a/37'' + 7/3^ 7 
+ 18 a 7 7'^ + 12 /? 7 7' V 6 7^ 7'^ 

which does not factor as above. 

Next let's get the solution to (jH}, together with the initial condition u{0,y) = 1, for a 
completely general set of parameters (q;,/3, 7, a', /3',7'). The package pdsolve first returns 
the general solution u{x, y) in the form 



(/3 + P'yf^~'-^y-"^f -y-^l [P + P'yy-f'ax - 2F1 



a 



1 - - 

^ /3 



y- 



/3 



/3/3-]P 



:iii 



The condition ti(0, ?/) = 1 gives 
1 



/ 



a 



-y ^ 2F1 



/3' -'-"'" /3' /3 
1-^ 



/5' 
'7 



a a 

pj-W 



Tl + l+TL 7 



(/3 + /3'?/)-'^+^+^?//^. 



Hence let y = y{t) be the inverse function of the argument of / in the above. If we replace 
y by y(t) throughout, we obtain 



3+1+^ 



f{t) = {P + P'y{t)r^^'^J^y{t)^. 

With this choice of /, the general solution is given by (|lip. though in quite an unwieldy 
form. 



2.3 Reality of the zeros 

We will now study the polynomials 



X 



[n 



0,1,2,...), 



with a view to describing when they will have real zeros only. In the cases of binomial 
coefficients, both kinds of Stirling numbers, and others, this is true. We limit the discussion 
to the case where all of a,/?, 7, a', /3', 7' are nonnegative. Even in this case, however, it is 
not true that the polynomials must have real zeros only. If we take 

(a, A 7, a', /3', 7') = (3, 1,1, 0,1,0), 

for example, then 03 (x) = 2{x + l)(3x^ + 12x + 14), which has two nonreal zeros. 

If we multiply the "Pascal triangle" recurrence (I7|) by x'' and sum on k we find the 
recurrence for the 0„ polynomials as 

(pn+iix) = {{an + 7) + {an + /?' + 'y')x)(pn{x) + {(3 + f3'x)x(f)'n{x). (12) 

First suppose that P = P' = 0. Then (fT^ gives the explicit expression 

n-l 

(Pn{x) = Y[{{aj + 7) + {a'j + -f')x), 
i=o 

which plainly has real zeros only. 

Next, suppose /3' = 0, and /?, 7, 7' > 0. Let, for n fixed, 

u = a{n — 1) + 7, V = a'{n — 1) + 7'. 

In this case the <pn recurrence can be cast in the form 

J/3x|x|-"/^e-^^/^£ {|x|"/V^/^</)„_i(x)} , if a; ^ 0, n > 1; 
Ulj=o(7 + J"), ifx = 0, 

with (f)o{x) = 1. Now u,v,(3 are positive. If, inductively, 0n_i(x) has n — 1 negative real 
zeros then the function |x|"/^e*"^/^0ri_i(x) vanishes at those n — 1 points and at — cxo and 
at the origin. By Rolle's theorem, the derivative of this function must vanish at n points, 
whence (fin{x) has n negative real zeros. 

This shows that ii f3' = and a, a', (5 are nonnegative, and 7 and 7' are positive, then the 
zeros are all real and negative. By an obvious continuity argument, it must be that if /?' = 
and the remaining parameters are all nonnegative then the zeros are real and nonpositive. 
Hence we have. 

Theorem 1 ///?' = anda, P,'j,a',j' are nonnegative, then all of the polynomials {(f)n{x)}'^ 
have real zeros only. There are examples with (3' ^ in which non-real zeros exist. 



2.4 A question: the row sums 

We define the row sums of tfie generalized Pascal triangle, 



pH = Yl 



(n = 0,l,2,...). (14) 



In the classical cases these row sums are variously 2", n\, Bell(ra), etc. We ask for infor- 
mation about their asymptotic growth rate, under the assumption that a' = so that the 
explicit generating function Q is available. Indeed from (jH)) we have, by putting y := 1, the 
generating function of the row sums as 



n^) = Y.p( 



.X I i — ax) a 
n)— = 

'l + f(l-(l-ax)-l: 



From this, the exact formula for the row sums is 



M±7 1\ / /Q/\ -J-l-77/3' 



^'"'^-se;^)("^T-^)(-f) 



What is the asymptotic behavior of these row sums for large n? This will depend on the 
analytic character of F{x) above, and will no doubt require a careful case analysis. 
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